Access Support Tree & TextArray: A Data Structure for XML Document Storage & Retrieval
نویسندگان
چکیده
The characteristics of XML documents require new ways of storing and querying such documents. Queries on both textual content and structural aspects must be supported efficiently. For this reason, we examined existing work on both document storage approaches and models for querying documents deriving requirements that are essential for the storage of XML documents. As a result of our study, we designed the Access Support Tree and TextArray (AST/TA) data structure. The important idea of the AST/TA data structure is the separation of the (logical) structure of a document from its ”visible” text content, which is represented as a single contiguous string, at the same time providing a tight integration to guarantee consistent changes. We introduce the AST/TA data structure formally by its abstraction, namely the AST/TA model and compare requirements of our AST/TA approach with those found in current work. Finally, we demonstrate the superiority of the AST/TA model by describing those operations that take advantage of the design principles of the AST/TA data structure.
منابع مشابه
The XML Query Execution Engine (XEE)
The characteristics of XML documents require new ways of storing and querying such documents. Queries on both textual content and structural aspects must be supported efficiently. In this paper, we introduce the motivation and concept of our XML Query Execution Engine (XEE) and its current implementation. XEE provides a testbed for our Access Support Tree and TextArray data structure of which t...
متن کاملA structure- and content-based multimedia information retrieval system for XML documents
Because the number of XML documents is dramatically increasing, we need to develop a multimedia information retrieval system which can support both the retrieval based on document structure and the retrieval based on image content. In order to support the structure-based retrieval, we design keyword, structure, element, and attribute index structures by indexing XML documents based on the basic...
متن کاملStructure- and Content-Based Retrieval for XML Documents
Copyright © 2001, Idea Group Publishing. ABSTRACT As the number of XML documents is dramatically increasing, it is necessary to develop an XML document retrieval system that can support both structurebased retrieval and content-based retrieval. In order to support the structurebased retrieval, we design four efficient index structures, i.e., keyword, structure, element and attribute index, by i...
متن کاملA DOM-Based Approach of Storage and Retrieval of XML Documents Using Relational Databases
This paper describes a novel approach of storage and retrieval of XML documents using relational databases. In this approach, an XML document is decomposed into nodes based on its tree structure, and stored into relational tables according to the nodes types. Our approach enables us to store XML documents using a fixed relational schema without any information about XML schema, and DTD. For the...
متن کاملApply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کامل